feature: Add common TranscriptionModel interface for audio transcription #1484

mudabirhussain · 2024-10-07T11:58:25Z

Created TranscriptionModel interface that extends Model<AudioTranscriptionPrompt, AudioTranscriptionResponse>
Implemented call(AudioTranscriptionPrompt) method for better compatibility between OpenAI and Azure OpenAI transcription models
Added default convenience methods for handling Resource and AudioTranscriptionOptions to return transcription as a String

Resolution of this opened issue: #1478

- Created TranscriptionModel interface that extends Model<AudioTranscriptionPrompt, AudioTranscriptionResponse> - Implemented `call(AudioTranscriptionPrompt)` method for better compatibility between OpenAI and Azure OpenAI transcription models - Added default convenience methods for handling Resource and AudioTranscriptionOptions to return transcription as a String

habuma

This looks pretty much like what I had in mind. Well done.

piotrooo · 2024-10-07T21:08:54Z

Meanwhile, as an enrichment to transcription take a look at #1278

ThomasVitale · 2024-10-11T14:25:19Z

spring-ai-core/src/main/java/org/springframework/ai/model/TranscriptionModel.java

@@ -0,0 +1,22 @@
+package org.springframework.ai.model;


Should this interface be in the package org.springframework.ai.model.audio.transcription?

yes, it should.

Model interfaces should be placed in packages that reflect their functional domain:

For single-level domains:
org.springframework.ai.<domain>

For hierarchical domains:
org.springframework.ai.<category>.<subdomain>

Model Interface Package Location

EmbeddingModel org.springframework.ai.embedding

ModerationModel org.springframework.ai.moderation

TextToSpeechModel org.springframework.ai.audio.tts

kpavlov · 2024-10-22T17:10:02Z

Tests should be added too.
How did the build pass without test coverage 🤔

markpollack · 2025-07-14T20:53:16Z

Is there another transcription model we can use to verify the abstraction?

markpollack · 2025-07-14T22:31:55Z

I've added the additional tests to check the default methods etc.

Merged in 4cf2377

Thanks @mudabirhussain and others.

mudabirhussain added 2 commits October 7, 2024 15:56

Spring Java Format Fix

4c8ce35

habuma reviewed Oct 7, 2024

View reviewed changes

ThomasVitale reviewed Oct 11, 2024

View reviewed changes

markpollack added this to the 1.0.x milestone May 6, 2025

markpollack added the enhancement New feature or request label Jul 11, 2025

markpollack closed this Jul 14, 2025

markpollack self-assigned this Jul 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feature: Add common TranscriptionModel interface for audio transcription #1484

feature: Add common TranscriptionModel interface for audio transcription #1484

Uh oh!

mudabirhussain commented Oct 7, 2024

Uh oh!

habuma left a comment

Uh oh!

piotrooo commented Oct 7, 2024

Uh oh!

ThomasVitale Oct 11, 2024

Uh oh!

markpollack Jul 14, 2025

Uh oh!

kpavlov commented Oct 22, 2024 •

edited

Loading

Uh oh!

markpollack commented Jul 14, 2025

Uh oh!

markpollack commented Jul 14, 2025

Uh oh!

Uh oh!

Model Interface	Package Location
`EmbeddingModel`	`org.springframework.ai.embedding`
`ModerationModel`	`org.springframework.ai.moderation`
`TextToSpeechModel`	`org.springframework.ai.audio.tts`

feature: Add common TranscriptionModel interface for audio transcription #1484

feature: Add common TranscriptionModel interface for audio transcription #1484

Uh oh!

Conversation

mudabirhussain commented Oct 7, 2024

Uh oh!

habuma left a comment

Choose a reason for hiding this comment

Uh oh!

piotrooo commented Oct 7, 2024

Uh oh!

ThomasVitale Oct 11, 2024

Choose a reason for hiding this comment

Uh oh!

markpollack Jul 14, 2025

Choose a reason for hiding this comment

Uh oh!

kpavlov commented Oct 22, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

markpollack commented Jul 14, 2025

Uh oh!

markpollack commented Jul 14, 2025

Uh oh!

Uh oh!

kpavlov commented Oct 22, 2024 •

edited

Loading